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ABSTRACT 



Stellar models generally use simple parametrizations to treat convection. The 
most widely used parametrization is the so-called "Mixing Length Theory" where 
the convective eddy sizes are described using a single number, a, the mixing- 
length parameter. This is a free parameter, and the general practice is to calibrate 
a using the known properties of the Sun and apply that to all stars. Using 
data from NASA's Kepler mission we show that using the solar-calibrated a is 
not always appropriate, and that in many cases it would lead to estimates of 
initial helium abundances that are lower than the primordial helium abundance. 
Kepler data allow us to calibrate a for many other stars and we show that for 
the sample of stars we have studied, the mixing-length parameter is generally 
lower than the solar value. We studied the correlation between a and stellar 
properties, and we find that a increases with metallicity. We therefore conclude 
that results obtained by fitting stellar models or by using population-synthesis 
models constructed with solar values of a are likely to have large systematic 
errors. Our results also confirm theoretical expectations that the mixing-length 
parameter should vary with stellar properties. 

Subject headings: stars: fundamental parameters — stars: interiors — stars: 
oscillations 



1. Introduction 

Accurately treating convective heat transport in stellar models is difficult. The structure 
and evolution of most stars is related to convective transport processes in their outer lay- 
ers. The transition from efficient convective transport in the deep envelope to the radiative 
atmospheric layers takes place in a region of inefficient convection where the temperature 
gradient is highly superadiabatic. The poorly known structure of this region remains one 
of the major uncertainties in stellar models. In one- dimensio nal calculations, this region 



is usually modeled using the "mixing length theory," or MLT (jBohm-Vitensd Il958l ) . This 
prescription assumes that one can approximate the full range of turbulent eddy-sizes by a 
typical size, and that an eddy on average travels a distance determined by the eddy size 
before losing its identity. This distance, is known as the "mixing length" and is usually 
defined as aH p , where a is known as the "mixing-length parameter," and H p is the local 
pressure scale height. Given the atmospheric structure, the approximation, when given the 
mixing length, fix the specific entropy in the deep convection zone, which in turn determines 



2 



the radius of the model. The model radius thus depends sensitively on the choice of the 
mixing-length parameter a, which is a free parameter that cannot be determined from the 
mixing length theory. 

The common practice when modeling stars to determine their structure and evolution 
is to use the solar value of a. We know the mass, radius, luminosity and age of the Sun 
precisely. Solar models are constructed by searching for a and the initial helium abundance 
Y that yield a model with the correct radius and luminosity at the Sun's age. Since masses, 
radii and ages are usually unknown for other stars, the solar approach is unfeasible and, 
hence, the solar value of a is used. 

The assumption of a fixed a has no a priori justification, and inde ed, there i s some 



evidence that the solar value of a does not always work for other stars. iLattanziol (119841 ) 



first noted that the radius of a Cen A cannot be reproduced usi ng the solar valu e of a . 



A combined astrometric and seismic study of the a Cen system by iDemarque et al.l (119861 ) 
confirmed this result as have more recent studies (e.g., Fernandes & Neuforge 1995; Miglio 
& Montalban 2005). Some studies of other binary systems have also suggested a mass- 
dependence of a (e.g., Ludwig & Salaris 1999; Morel et al. 2000; Lebreton et al. 2001; 
Lastennet et al. 2003, etc.). In particular, Yildiz et al. (2006) studied binaries in the Hyades 
cluster and suggested that a increases with stellar mass. Numerical simulations of stellar 
convection also suggest that convective properties vary with stellar parameters (e.g., Ludwig 
et al. 1999; Trampedach 2007; Trampedach & Stein 2011). 

While studies of binaries and clusters have suggested that the solar a is not always 
applicable, the situation for single field stars is not clear because of the lack of observational 
constraints. Asteroseismic data from space missions like CoRoT (Michel et al. 2008) and 
Kepler (Borucki et al. 2010) allow us to place independent constraints on the mass and 
radius of single stars. These constraints, along with the classical constraints of T e g and 
metallicity, allow us to constrain a. We already know that the oscillation spectra of some 
CoRoT and Kepler stars cannot be reproduced using the solar value of a (e.g., Metcalfe et 
al. 2010; Deheuvels & Michel 2011, Mathur et al. 2012). Detailed asteroseismic modeling 
of the solar analogs 16 Cyg A & B has also required the adoption of non-solar values of a 
(Metcalfe et al. 2012). 

In this study we use asteroseismic data obtained by NASA's Kepler mission to calibrate 
a for a sample of dwarfs and subgiants. Observational data and our analysis technique are 
described in § O We present our results and discuss their implications in § [31 

It should be noted that a is just a proxy for describing stellar convection, and in partic- 
ular a describes the entropy change between the surface and the deeper, isentropic, layers of 
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efficient convection. As a result, the value of a cannot be derived uniquely. It depends on the 
exact formulation of the mixing-length theory (see, e.g., Appendix A of Ludwig et al. 1999), 
as well as physics inputs that affect entropy e.g., atmospheric opacities, the temperature- 
optical depth relation (T-r) in the atmosphere, the equation of state and processes such as 
the gravitational settling of helium and heavy elements. Thus, the value of a in a given star 
needs to be examined in the context of the value of a needed to construct a solar model with 
the same physics. 

Note that conventional MLT assumes a fixed ratio between the distance traveled by 
an eddy and its size, and a is the only free parameter. Some approximations (see e.g., 
Arnett et al. 2010) leave the ratio as an adjustable parameter. We use the conventional 
Bohm-Virtense form of MLT and only adjust a. 

2. Method 

We used data obtained by the Kepler asteroseismology program (Gilliland et al. 2010) 
during its survey phase. The survey collected data on more than 2000 main-sequence and 
sub-giant stars and stellar oscillations were detected in about 500 (Chaplin et al. 2011). Of 
these we used a subset of 90 stars for which spectroscopic estimates of effective temperature 
T e ff and metallicity [M/H] are available from Bruntt et al. (2012). 

We used the average large separation Av and the frequency of maximum oscillation 
power i/ max for this work. The large separation scales approximately as the square root of 
the mean density of a star (Ulrich 1986; Christensen-Dalsgaard 1988) while the frequency of 
maximum power scales approximately as g/y/T^g (Brown et al. 1991; Kjeldsen & Bedding 
1995; Bedding & Kjeldsen 2003). The Av and f max values are those used by Verner et al. 
(2011) to verify the Kepler Input Catalog (Brown et al. 2011). 

First, Av, v max , T e g and [M/H] are used to estimate the mass and radius of each star. For 
this we used the grid-based Yale-Birmingham (YB) pipeline described by Basu et al. (2010) 
Gai et al. (2011) and Basu et al. (2012). We use four stellar-model grids for our work: 
models from the Yonsei-Yale (YY) isochrones (Demarque et al. 2004), and those of Dotter 
et al. (2008), Marigo et al. (2008) and Gai et al. (2010). The grids have been constructed 
with different physics inputs and modeling parameters, and with solar a consistent with the 
inputs to the grids. For subsequent calculations the average of the mass and radius estimates 
returned by the four grids is used. 

The properties of our final sample are shown in Fig. [H Note that our sample is quite 
restricted in terms of log g and we lack stars that are close to the base of the red giant branch; 
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in fact we have very few stars with \ogg < 4. This will make finding a log g dependence of 
the derived a difficult. There is the expected correlation between mass and T e s — the higher 
temperature stars are generally more massive. Although each mass range spans a range of 
log g, unsurprisingly the least evolved stars in our sample are also the least massive. Most 
stars in our sample have sub-solar metallicities. 

Starting with mass, radius, T c fj and metallicity, each star in our sample was modeled 
using the Yale Stellar Evolution Code (YREC; Demarque et al. 2008) in an iterative manner. 
In this mode, radius and T c q were specified and the code determined either a or the initial 
helium abundance (Y ) that yielded the specified radius at the given T e g for a given mass. 
In the former case Yo has to be specified, in the latter a has to be specified. 

The input physics consisted of the OPAL equation of state (Rogers & Nayfonov 2002), 
OPAL high-temperature opacities (Iglesias & Rogers 1996) supplemented with Ferguson 
et al. (2005) low temperature opacities. Nuclear reaction rates were from Adelberger et 
al. (1998), except for the u N(p,^) 15 reaction, which was fixed at the value of Formicola 
et al. (2004). Models did not include core overshoot or the diffusion and settling of helium 
and heavy elements. We used the Eddington T — r relation in the atmosphere. With the 
above physics, the solar-calibrated value of a is 1.690. Including gravitational settling of 
helium and metals would change that to 1.826. 

For the first set of calculations we assumed solar a for all stars and determined Y that 
would be needed to model the stars. We then performed two other sets of calculations: 
(1) we estimated a assuming that all stars have the solar value of Y = 0.278 (Serenelli & 
Basu 2010); (2) we estimated a assuming that Yo follows a simple chemical evolution model, 
Y = 0.245 + 1.54Z (Dotter et al. 2008). 

The iterative modeling process was repeated for 200 Monte Carlo realizations of R, M, T eff 
and [M/H] to estimate the uncertainties in Yo (or a). Parameters in each realization were 
randomly chosen from a Gaussian distribution centered on the measured central value, with 
the dispersion equal to the mean measurement error. To avoid uncertainties due to small 
number statistics, we show results of stars with at least 20 converged iterations. This re- 
quirement resulted in a final sample of about 55 stars. The median of the distribution 
of parameter values is quoted as the central value of the parameter, and uncertainties are 
determined as the 68% confidence limit of the distribution. 

The reliability of the derived a estimates depends on the reliability of the observations 
and those of the mass and radius estimates. There are indications that grid-based mass 
estimates can have systematic errors caused by differences in the input physics of the grids; 
however, these are smaller than those caused by uncertainties in T e g, [M/H], Au and z/ max 
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(Basu et al. 2012). Mathur et al. (2012) have shown that grid-based estimates of stellar 
masses and radii agree very well with those obtained from more detailed modeling of the 
oscillation frequencies, giving us confidence in the robustness of our mass and radius esti- 
mates. While detailed modeling of the oscillation spectrum is preferable for all stars, this is 
beyond the scope of this paper. 

3. Results and Discussion 

The initial helium abundance, Y , needed to construct models of the stars in our sample 
- assuming that they all have the solar value of a — is shown in Fig. [2j Note that for 
> 50% of our sample the Y estimate is less than the primordial value of Y p = 0.2477±0.0029 
(Peimbert et al. 2007). While the deficit is within la for some stars, there are stars with 
a > 3a deficit. The weighted average of Y of the sample is 0.227 ± 0.004. The weighted 
median is 0.223, both below Y p . It is, of course, highly unlikely that stars are born with 
less helium than was produced in the Big Bang. Given that the only parameter we could 
change in MLT is a, this implies that solar a does not properly approximate convective heat 
transport in these stars. Note that a change of physics inputs to the models will change the 
solar value of a, and the exercise repeated with the new solar a would give similar results. 

In Fig. [3^a) we show the value of a for our sample obtained assuming either the solar 
value of Y (red points), or the simple chemical evolution model of Y (black points). Note 
that a for most of the stars is less than the solar value for both cases The average value 
of a for this sample is 1.522 for solar Yq and 1.597 when the chemical-evolution model is 
used. In MLT, lower a implies less efficient convection. Thus MLT predicts that in the 
superadiabatic layers, convective energy transport in our sample is generally less efficient 
than that in the Sun. Since the results with the two choices of Yq are similar, in the 
subsequent discussions we only use a obtained with the chemical evolution model of Yq. 
Mathur et al. (2012) constructed detailed models to fit the mode frequencies of 22 Kepler 
stars using the Asteroseismic Modeling Portal (AMP; Metcalfe et al. 2009) There are 16 
stars in common with our sample. In Fig. [3](b) we show the differences between the Mathur 
et al. a values and the ones obtained in this work. The two a estimates agree well, mostly 
within la. Since the physics in the AMP models is different from ours, they obtain a solar 
a of 2.12. Thus to compare their results with ours, we have scaled the AMP results to our 
value of the solar a. In Fig[3^c) and (d) we show the variation of a with logg and [M/H]. 

In order to explore whether our a estimates are correlated with stellar properties, we 
first determined the simple Spearman rank correlation between a and different properties. 
The correlation coefficients are listed in Table 1. Also listed is the p- value, which is the 
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probability that the correlation is a chance occurrence. A small p therefore indicates a 
significant correlation. There thus appears to be significant correlation between metallicity 
and a. There also seems to be a mildly significant correlation between mass and a. The 
significance of the correlation of a with logg or T e g depends on whether or not we include 
the low-log g stars in our analysis. Table 1 lists the coefficient obtained for the entire sample, 
as well as that obtained by removing the lowest logg stars (logg < 3.8) in our sample. 

Since a depends simultaneously on a number of parameters, to get a better estimate of 
the correlations we perform a trilinear fit to a with the model 

a = a + b\ogg + c\ogT eS + d[M/n]. (1) 

Table 1 lists the coefficients and p- values, and Fig.HJshows the residuals, and partial residuals, 
of the fit to Eq. [TJ Note that the metallicity dependence is robust. The logg and T cS 
correlations are small and less statistically significant when the entire sample is used; these 
increase in significance, but change signs, when the logg cut is applied. The mass correlation 
seen in the Spearman correlation is most likely to be the result of the mass-T e g- and mass- 
log g correlation seen in Fig. 1 as indicated by the fact that the residuals of the fit to Eq. [1] 
do not show any trend with mass (Fig. H^a) and (e)). 

The metallicity dependence of a is relatively easy to understand. It is most likely caused 
by the temperature sensitivity of the H~ density and hence, of its dominant contribution 
to the optical continuum opacity. E.g., in the solar photosphere, ~ 50% of the electrons 
that form H~ are donated by metals, and the fraction increases steeply with height due to 
low-ionization potential elements like Na, Al, K, Ca and Cr. With a smaller amount of 
metals, the temperature sensitivity of the H~ density will therefore increase. This in turn 
will increase the contrast between up- and down-flows, and especially increase the range 
of depths over which the down-flows will be cooled. This results in a larger entropy jump 
between the surface and the deeper layers, meaning a lower convective efficiency (a smaller a 
in the context of MLT). Since the average metallicity of our sample is sub-solar, we believe 
that this metallicity dependence accounts for the lower-than-solar average value of a for our 
sample. 

The lack of a significant correlation between a and T e g or logg is surprising. This is 
most likely the result of the limited and skewed range of logg of our sample, and is confirmed 
by the change of the sign of the correlation when the log g cutoff is applied. A larger sample 
should resolve these issues, in particular, data on giants should help determine the logg 
dependence properly. Piau et al. (2011), using a sample of red giants with radii known from 
interferometry, have shown that the red-giant models require sub-solar values of a to explain 
the observations; however, they did not address the dependence of a on stellar parameters. 
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As noted earlier, a is just a proxy for describing convection in stars. Although it is 
known that using such a proxy does not reproduce properties of the stellar near-surface 
layers correctly, MLT remains a practical tool in stellar modeling. An a type parameter 
can also be derived from numerical simulations of stellar convection (e.g., Ludwig et al. 
1999; Trampedach et al. 1999; Trampedach 2007). At present it is difficult to compare 
these results with ours since the simulations were for solar composition, and the metallicity 
dependence of our results is fairly strong. However, there do seem to be some differences 
between our findings and the simulations. At a given log g, Ludwig et al. (1999) found a to 
decrease with increasing T ef f for dwarfs in their 2D simulations and they find a to decrease 
with logg. A similar behavior was seen in the 3D simulations of Trampedach (2007). Our 
a-T e s correlation agrees with theirs, but the logg one does not when we examine the entire 
sample; when we apply the log g cutoff the reverse becomes true, the logg correlation agrees, 
the T e ff correlation does not. 

As more detailed asteroseismic data become available from Kepler, and they are mod- 
eled, we will be able to reduce the uncertainties in the mixing-length parameters needed to 
model the stars, and the dependence of the properties of near-surface convection, including 
the corresponding value of a, on stellar parameters will become clearer. As it is, our results 
have important implications for the different branches of astrophysics that depend on fitting 
stellar models. The usual way to determine stellar properties is through spectroscopic or 
photometric analyses of the star combined with fitting to grids of stellar models to obtain 
masses and radii (e.g., Takeda et al. 2007) or ages (e.g., J0rgensen & Lindegren 2005). At 
a given metallicity, a lower a makes the evolutionary track of a given mass redder than its 
higher-a counterpart, and thus the properties of a star obtained using models constructed 
with a solar a will be quite different from those obtained with a sub-solar a. Although the 
T e ff change with a is really a result of a radius change, it appears as a change in the estimated 
mass of star being fitted (Basu et al. 2011). Consequently, we would be underestimating 
the mass of a sub-solar metallicity star if we use models constructed with the solar value 
of a. Stellar population and spectral synthesis models also use a single (usually the solar 
calibrated) value of a (see e.g. Coelho et al. 2007), and our results now show that the 
uncertainties in a need to be added to the error budget of results that use those models. 

We thank the anonymous referee for comments that have improved this paper. Funding 
for the Kepler mission is provided by NASA's Science Mission Directorate. This work was 
partially supported by NSF grant AST-1 105930 and NASA grant NNX09AJ53G. We also 
thank all other funding councils and agencies that have supported the activities of Working 
Group 1 of the Kepler Asteroseismic Science Consortium. 
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Table 1: Correlations and the associated p value between a and stellar parameters. 
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Fig. 1. — (a) Av of the stars plotted against T e g. The gray lines are tracks for masses 0.9 
M Q to 1.4 M Q ; (b) Derived log g plotted against T e fj; (c) The derived mass as a function of 
T e g] and (d) the derived mass plotted as a function of the derived logg. In panels (b)-(d) 
the color scale denotes metallicity (dex). 
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Fig. 2. — The initial helium abundance, Y Q , of stars in our sample obtained with the solar 
value of a. The black line marks the primordial Big Bang nucleosynthesis helium fraction, 
Y p = 0.2477 ± 0.0029. The pink line is the weighted mean of the sample, and the blue is the 
weighted median. The dashed lines show la errors. 
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Fig. 3. — (a) The derived mixing length assuming the solar value of Yq (red) and for the 
chemical evolution model of To (black). The blue line marks solar a. (b) The differences 
between a obtained by Mathur et al. (2012) by fitting individual frequencies and our a 
estimates of the same stars, (c) a plotted as a function of log g, and (d) a plotted as a 
function of [M/H]. Only a estimated with the chemical evolution model of Y are plotted in 
(c) and (d). 
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Fig. 4.— The residuals and partial residuals for the trilinear fit (Eq. [p. Panels (a)-(d) are 
results when the entire sample is used. Panels (e)-(h) are results for logg > 3.8; the low log g 
points are however still shown. The intercept a changes when we apply the log g cutoff. The 
blue lines are blogg [panels (b), (f)], clogT e fj [Panels (c),(g)] and d[M/H] [panels (d),(h)]. 
Note that the logg cut-off makes the correlations tighter and that the [M/H] relation remains 
almost unchanged with and without the logg cutoff. 
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